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Abstract 

In the current work the effects of hop-by-hop packet loss and retransmissions via ARQ protocols are 
investigated within a Mobile Ad-hoc NET-work (MANET). Errors occur due to outages and a success probability 
function is related to each Unk, which can be controlled by power and rate allocation. We first derive the expression 
for the network's capacity region, where the success function plays a critical role. Properties of the latter as well 
as the related maximum goodput function are presented and proved. A Network Utility Maximization problem 
(NUM) with stability constraints is further formulated which decomposes into (a) the input rate control problem 
and (b) the scheduling problem. Under certain assumptions problem (b) is relaxed to a weighted sum maximization 
problem with number of summants equal to the number of nodes. This further allows the formulation of a non- 
cooperative game where each node decides independently over its transmitting power through a chosen link. Use 
of supermodular game theory suggests a price based algorithm that converges to a power allocation satisfying 
the necessary optimaUty conditions of (b). Implementation issues are considered so that minimum information 
exchange between interfering nodes is required. Simulations illustrate that the suggested algorithm brings near 
optimal results. 
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I. Introduction 

The current work considers a Mobile Ad-hoc NETwork (MANET) where data flows entering from 
a set of source nodes should be routed to their destinations. In such networks a major concern is the 
maximum set of incoming rates that can be supported, since interference is the bottleneck. If a utility 
function is related to each incoming flow a very interesting problem is to maximize the sum of all utilities 
under the constraint that the queues of all nodes remain stable. Such problems have been addressed in 
[1], [2], [3], [4], [5] and algorithms that optimally adapt the incoming rates and the transmission powers 
of each node have been suggested. 

In the current work we are interested in bringing these models a step further and investigate how 
the stability regions and the optimal policies for congestion control, routing and power allocation vary, 
when the queues of each node use ARQ protocols to repeat transmissions of erroneous packets due 
to outages. In the current literature, investigations already addressing the network utility maximization 
(NUM) problem with erroneous transmissions through the links consider mainly fixed routing. In [6] the 
model does not consider queueing aspects and a NUM problem with rate-outage constraints per link is 
approximately solved. In [7] the effect of end-to-end error probability is included in the utility of each 
source. The same problem with average power and reliability requirements is posed and algorithmically 
solved in [8]. Furthermore, in [9] a single hop ad-hoc network with outages is considered where a 
solution for joint admission control, rate and power allocation is derived based on a stochastic game 
formulation. Other contributions that investigate the effect of retransmissions in MANETs incorporating 
Random Access MAC protocols include [10], [11]. 

Motivated by a comment in [12] where it is stated that "in practical communication systems, the link 
capacity should be defined appropriately, taking packet loss and retransmissions into account, hence the 
flow conservation law holds for goodputs instead of rates", and after a presentation of the model under 
study in Section n, we derive in Section IE the goodput capacity region. The success probability for the 
transmission over a wireless link depends on the entire power allocation and the scheduled transmission 
rate. We constrain our investigations to functions with specific properties presented in section IV, where 
it is shown that these also hold for the expression with Rayleigh fading [13]. The NUM problem 
naturally decomposes in Section V into the input rate control and the scheduling problem. 

At this point a major challenge is to achieve a decentralized solution of the second problem. This 
is always possible of course for the case of parallel channels (see also [12] and [14]). Algorithmic 
suggestions can be found in [2] for zero-full power allocations and in [15] by solving a maximum 
weighted matching problem over a conflict graph. In our work fully distributed implementation is 
achieved by approaching the second problem with the arsenal of supermodular game theory in section 
VI - an idea appearing in [16] and [17] - and result to the suggestion of a price based algortithm in 
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VI.D and VI.E that achieves almost optimal solutions with minimum information exchange between the 
nodes. Simulation results are finally presented in section VII. 

II. Model under Study 

We consider a wireless network consisting of nodes H = {1, . . . , A^}, while C is the set of all 
possible L = N ■ {N — \) links. The time is divided into slots of equal duration T (normalized to 1) 
and t = {0, 1, . . .} is the time index. Data flows enter the network at source nodes and are removed at 
destination nodes V = {1, . . . , D}. The set of data flows {commodity flows) injected into the network at 
a source node with a predefined destination is denoted by 5 = {1, . . . , S"}. The routes of the data flows 
through the network are not fixed. Furthermore, each link / G £ is characterized by an origin node h {I) 
(begin) and an end node e (/) (end). At each node n, a total of D buffers - one for each commodity 
flow - are reserved (see also Fig{T])- 

In the general case investigated, each node n E M chooses at slot t a power pi (t) as well as a rate 
/i; (t) to transmit data through link /, as long as n = b(l). The total transmission rate of packets through 
link I at some time slot t is the sum of the transmission rates of the individual commodities sharing 
the link meaning /i; (t) = J2d=i /^f (0- Scheduled packets of variable length /if (t) for each commodity 
d are combined into a common packet of length jii (t) and sent through the link. The resulting long 
packet may be received at node e (/) with errors due to fading and interference. The probability of 
successful transmission is then a real valued function q {p, /i) : R^^^ x ^ [0, 1] of the entire 
power allocation at slot t, p{t) G 11 C ]R^'*^^~^^ and the scheduled rate /i; (t). The set of all possible 
scheduling rates is denoted by A4. The nodes have power restrictions, e.g. Vra, Ylii-n=h{i)Pi i^) — 
and n is the convex compact set of all feasible power allocations. 

Examples of such success probability functions for flat block-fading channels can be found using the 
outage probability definition [18]. Given an SINRi threshold value 7 (/i;) = e^' — 1 of link / (we often 
simply write 7; := 7 (/x,)) 

qi {p, fii) = P {SINRi (p) > e^^ - 1) , SINRi (p) = 2 (D 

where Yij stands for Yb[j)e{i), fJ^i is the scheduled transmission rate through the link, Gh(j)e(i) is the 
slow varying path gain and Fh(j)e(i) is the associated flat fading component of the channel. For the 
case of Rayleigh/Rayleigh fading (meaning Rayleigh slow fading for both the desired and interference 
signals), a closed form expression of ^ can be found in [13] and [19] 
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Observe that the success functions used imply that only the channel fast fading statistics are known and 
the nodes have no other instantaneous channel state information (CSI) over the fading gains, except - 
possibly - the slow varying path gains. The actual amount of data transmitted through each link equals 
jii it) ■ Xi [t). Xi [t) is a binary random variable which equals 1 for success (with prob. qi) and for 
failure (with prob. 1 — qi). The expected transmission rate through link I is then 



gi {p, fii) := Hi ■ qi {p, fii) (3) 

and is called the goodput of link / [20], [21]. Furthermore, in the analysis that follows we often encounter 
a quantity called maximum goodput defined as (see [20] and [22] for parallel Rayleigh fading channels) 



gi {p) = max fii ■ qi {p, fn) . (4) 

In case a packet of length /i; (t) is received at node e (/) with errors, we assume that this can always 
be detected during decoding. When reception is correct an ACK is fed back otherwise a NAK signal 
is transmitted to b (/) via a reliable zero-delay wireless feedback link. In the latter case the packets of 
all transmitted commodities are then not removed from the buffer but wait for a future retransmission 
(Stop-and-Wait ARQ) under some new scheduling decision. The queue evolution for each node n and 
commodity flow d at slot t, is given by 



< {t + 1) 



k:n=b{k) 



+ J2 f^f{t)Xi{t)+at{t+l). (5) 

l:n=e{l) 



The success probability of the transmission through link / is equal for all commodities d, since it 
depends on the sum rate fii. In the expression ([5]), J2k-n=b{k) ^^k {^) ^k (t) is the actual outgoing data 
("actual" meaning "error free") from node n, X]z n=e(0 /^f (^) -^i ('^) actual incoming data from 

links I e C : n = e{l), n ^ d and {t + 1) is the amount of commodity d bits arriving exogenously 
to the network at node n during t. 

We associate each incoming flow to the network at node n with destination d E V, a'^ = as, with 
a utility function Us : M+ — > M+. The utility function takes as argument the average incoming data 
rate E (as) = Xs and is non-decreasing, strictly concave and continuously differentiable over the range 
Xs > (elastic traffic, [23]). The utilities describe the satisfaction received by transmitting data from 
node s E S to d eV. 

The aim here is to find an incoming rate vector x = (xi, . . . , xs) to maximize the sum of the utilities 
J2ses i^s) subject to the constraint that the system remains stable and furthermore explicitly provide 
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the stabilizing scheduling policy Vx G A. A denotes the capacity region of the system, the largest set 
of X for which the system remains stable. Formally we write 



max Us {xs) subject to x G A. (6) 

s&S 

III. Network Capacity Region and Variations with Dropping Packet Decisions 

The problem posed so far is similar to the models investigated in [2], [3], [4] and [15]. Due to the 
occurence of errors and the use of retransmissions, the capacity region of the model under investigation 
is definitely reduced and has a different expression compared to the works mentioned. 

Theorem 1 The capacity region A of the wireless network under study is the set of all non-negative 
vectors x = (xi, . . . ,xs) such that there exist multicommodity goodput flow variables {gf}'!^^, satis- 
fying 

• gf>0, Wle C,deV and gf = Oife{l)=d 

• ^d£V gf < gi' g = {gi} ^ r, where r = co (r^ and 

f = I J < ^ G ^+^^~^^ : G £, 5f/ < fii (p) ■ qi {p, Jii (p)) , fJ-i (p) = arg max ■ qi {p, fn) \ . (7) 

pGn 

Proof: Similar to the derivation of the network capacity region in [2] and can be found in [24]. 

■ 

In the above {s'f is the D - N ■ (N —1) size vector of goodput flow variables for all commodities 
through the network. An optimal policy achieving stability for all vectors within A is a variation of the 
well-known backpressure policy [1], [2] where goodputs replace the rate vectors. This is named here 
goodput backpressure policy. We further denote with T the goodput region of the network, which equals 
the convex hull (co) of f given in (|7]). Comparing this region to the ones appearing in [2] and [14] 
the rate-power mapping r/ (p) = log (1 + SINRi (p)) is replaced here by the maximum goodput-power 
mapping gi (p). 

Let us now assume that the nodes can decide, in addition to the transmission power pi and rate pi 
over the link I E C : n = b{l), whether the possibly erroneous packet at time slot t should be dropped 
or should be held in the node's queues and wait to be retransmitted at the next time slot t + 1. We use 
the binary decision variable Ai (t) taking values Ai (t) = for dropping decision and Ai (t) = 1 for a 
decision to continue. The single queue evolution will be the same as in © where Xi (and similarly 
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Xk) should be replaced by the expression 1 — Ai (t) (1 — Xi (t)) which equals Xi (t) when Ai (t) = 1 
and 1 when Ai (t) =0. 

If the decisions on dropping are randomized, with a fixed probability of dropping per link equal to 
1 — 5i E [0, 1] (and hence EA; (t) = 5i), the network capacity region A^, 5 = {Si, ... , 5l), will be the 
same as in Theorem [U with a modification on the region T. In this case we have that 

= U ^ ^'^"^"'^ ■.^l&C,gi< fif (l - 5, ■ (l - {p, pf))) } (8) 

pen 

fif := fii {p, 5i) = argmax^^gA4 fJ-i ■ {I - Si ■ {1 - qi (p, fn))). Choice of the vector 5 = 1 := (1, . . . , 1) 
results in the region of Theorem [T] where no dropping takes place, while for 5 = := (0, . . . ,0), 
dropping always takes place after an erroneous transmission and this provides the maximum network 
capacity region with Tg equal to 

r,^=o = {9^ ^'^"^^'^ : V/ e £, < /ir} (9) 

where fi* = argmax^jgy^ fii is the maximum allowable transmission rate per link. We can then obtain 
different regions between these two extremes by varying the dropping probabilities per link. To 
understand why this is important suppose that a network user transmitting a data flow with source node 
s E S has a higher data rate than that offered by the actual error free network capacity region A^^j. 
We may then vary the vector 6 so that the network will fit the requirements of the user. Of course the 
average rate of correctly transmitted packets through the network will not change. What will happen 
is that, instead of removing part of the user's packets at entering the network (admission control), the 
network will offer per link at least one chance for all packets to be correctly transmitted through the 
network, hence will be able to provide unreliable service to the entire required high data rate, with 
index of reliablity 5. 

IV. Properties of the success function and the maximum goodput function 

The success probability function qi (p, for transmission over link / E C considered in this work, 
has the following propertiei^]- 

• P.l is strictly increasing in pi and the log of the function is concave in pi 

• P.2 qi is strictly decreasing and convex in p^, \/k ^ l,k E C 

• P.3 qi is strictly decreasing in yU; 

'The game-theoretic notation qi {pi,p-i, /ii) is often used, where p_( is the entire power vector excluding the l-th element pi. 
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• P.4 The log of the function has increasing differences for the pair of variables (p;, meaning that 



logqi {pl,p-i,fii) -logqi {pi,p-i,^ii) < logqi {pl,p^i,fil) -logqi {puP-i,l4) (10) 

where > pi and ji^ > fii. 
• P.5 The log of the function has increasing differences for each pair of variables {puPj) ,Vj 7^ /. 

The differences are constant for all pairs {pi,pj), where i ^ j and i,j E C\{1}. 
The last property actually implies - using [25, Corollary 2.6.1] - that the function is \og- supermodular. 
By property P.4 a positive change on the transmission power pi has a greater impact on the increase of 
the (logarithm of the) success probability, the higher the rate of transmission. If we e.g. transmit with 
16-QAM modulation, an increase of power by Ap; > will increase logg much more than in the case 
of transmission with BPSK. 



Theorem 2 The success probability function for the Rayleigh/Rayleigh fading case, given in (121) satisfies 
properties P.1-P.5. 

Proof: For the proof, the expressions ([TT]) - (fT5l) of first and second order partial derivatives are 
required. Specifically, from ([TT]) and (fT2)) the function is increasing in pi and decreasing in pj (strictly if 
Pi > P™™ > and same for j). From (fT4l) the logarithm of the function is concave in pi. The convexity 
in P.2 comes directly from the partial derivative of ([72)) over pj which is easily shown to be positive. 
P.3 is shown in (fT3l) . whereas P.4 comes directly by derivating (fT3]) w.r.t. pi. Finally, P.5 is a direct 

(see [25, p.42]). ■ 



consequence of the fact that - in d - ^ 'llif'^''^ > and ^ 



dqi jPhP-hfJ-i) ^ 
dpi 

dqi {pi,p^i,fii) ^ 
dpj 

dqi {pi,p-i,fii) 

logqi {PhP-hl^i) 
dpf 

logqi {puP-uHi) 
dpidpj 



dpidpj 
qi {pi,p_i,fii) 



idpj 



GiiP? 



+ E 



-qiipi,p.l,fxi) ■ aj^ 



> 
< 



qi {pi,p-i,i^i) 

Giip't 



GiiPi 



' GijPj 



■j^l Giipi+-yi{fii)GijPj 



-+l 



+Pl 



< 

< 

> 0. 



(11) 

(12) 

(13) 
(14) 

(15) 



Using the above properties we can derive important properties for the maximum goodput function in 
dH), which as seen in ([7]) plays a critical role in the definition of the system capacity region. 
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Theorem 3 If the success probability function satisfies P.1-P.5 then the maximum goodput function in 
(I?]) has the following properties (where fii {p) = argmax^jg;^ fiiqi {p,fii)) 

• P\l gi ip) is strictly increasing in pi 

• P\2 Qi {p) is strictly decreasing and convex in pk, 'ik ^ I 

• P\3 jii (p) is non-decreasing in pi 

• P\4 fbi {p) is non-increasing in pk, Wk I 

Proof: Proofs of P'.l-P'.4 are found in Appendix A. ■ 
The above properties are illustrated in Figl2] and Figl3] using a success probability function with the 
expression in (O for the 2-user Rayleigh/Rayleigh fading case. These will not be directly used in what 
follows but are rather useful for the characterisation of the stability region and the optimal scheduling 
policies of such systems. Examples of the goodput region are shown in figures Fig j4] and Fig 15] for two 
simple network topologies: 2 transmitting nodes with 1 receiving, 1 transmitting node with 2 receiving. 

Remark 1 In economic terms, we can interpret the success probability function qi as the demand of 
product I in a market of L firms. In this framework fii is the product's price and gi is the firm's revenue. 
Then gi {pi,p^i, /i/) = fiixqi {puP-u l^i) is firm's I (revenue) = (price)x(demand). The demand is by P.3 
a decreasing function of the price, is increasing by PI in pi and decreasing by P.2 in pk, k ^ I. Then 
Pi can be interpreted as a variable valuating product's I quality (or maybe the money spent by firm I in 
advertisement) and pk as the quality (or money for advertisement) of products from competitors k ^ I. 
Then the maximum goodput gi {p) is the maximum revenue that a firm I can obtain by choosing an 
optimal price fii {p), given a vector p. By properties P\l and P\2 the maximum revenue is increasing 
in pi and decreasing in pk, k ^ I, whereas by P'.3 and P\4 the optimal price is also increasing in pi 
and decreasing in pk. Notice that if in PA log-supermodularity would be replaced by log-submodularity 
the optimal price would be a decreasing function of pi. 

V. NUM Problem Dual Decomposition 
The utility maximization problem in ^ given the network capacity region in Th. \T\ takes the form 

^^Xs>0, gf>0 J2seS {Xs) 

subject to E/;e(z)=n at + 4 < Efc;fe(fc)=n dt Vn, d 

geco (f) =r 

and r is given in The constraint set is convex and compact (see [26, Appendix 4.C]), the objective 
function is concave and Slater's condition can be shown to hold, hence strong duality also holds and 
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known distributed algorithms, like the one following, can solve the Lagrange dual problem min^^^ L yXj 
which involves the {N ■ D)-vector A of dual variables . The Lagrangian associated with the primal 
NUM problem is denoted by L (^x, A j while the dual function L ^A j yields, due to the linearity of the 
differential operator (see [23], [4], [15]) 



L{X] = > max Ws (xs) — XsXs] + laax >^ A 



9& 

sdS n,d 




and As = A„, = for the flow s with source node n and destination d. We can interpret A^ as the 
implicit cost per pair (n, d). Thus, the NUM problem is decomposed into: 
(a) The input rate control problem 



Prob.l : y m.ax{Us{xs) — Xs^s] (16) 

solved for each commodity flow at the incoming nodes independently Xs = Ug^^ (A^). Observe that by 
assumption Us{xs) is continuous and monotone decreasing in M+ (thus a bijection) and the inverse of 
the function exists. Since Us {xg) is strictly concave the solution is unique for each A^. 
(b) The scheduling problem 



Prob.2 : maxg^er En,d >i {^km=n ' Ei:e{i)=n dt) = max dt " (^6(0 - >^i(i)) 

l,d 

< max wi- gi (17) 
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Through each link I the commodity d* = argmax^j ^A^^^^ — A^^^^ j is scheduled to be routed with goodput 

rate gi and wi = max^ (^Xf^^^ — X'^f^iyO^. This is the well known backpressure policy [1]. The solution 
of (flTl) further provides the optimal multicommodity goodput flow variables {g^}. The optimal solution 
described is very similar to the DRPC policy in [2]. 

If we can solve ([T7]) distributedly, then algorithms can be provided, that solve the dual problem 
min L ^A j also in a distributed manner, and converge to the optimal average incoming rate vector x* 
and average price vector A*. The dual problem can be solved by the subgradient method. The prices A^ 
for each node-destination pair (n, d) are step-wise adjusted by 



Kit+l) = [X'At) + J, ■ (x'At) - 9t{t)+ E 9t{tW- (18) 

k:b(k)=n l:e{l)=n 
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In the above 7^ is a positive scalar stepsize, [. . .] denotes the projection onto the set ]R_|_ and for 
each t the values x'^ (t) and g"^ (t) are calculated by solving problems (fT6l) and (fTTl) respectively and 
using prices A(t). As noted in the aforementioned works, in practice a constant stepsize is used for 
implementation purposes, although the convergence of the algorithm is guaranteed for 74 0. For 
constant stepsizes statistical convergence to X*,x* is guaranteed as shown in [15, Def.l,Th.2]. 

VI. The Scheduling Problem 

A. Relaxation 

As was mentioned previously it is very important that the problem in (flTI) is solved in a distributed 
manner. To this aim the theory of supermodular games can be used. We make the following two 
assumptions 

1) Assumption 1: Each origin node chooses a single end node to transmit 

2) Assumption 2: Each node can transmit and receive at the same time 

3) Assumption 3: Fixed scheduled rates per link /i; are considered. 

Specifically the last assumption constraints the generality of the initial model but is necessary for the 
approach that follows. Variable scheduled rates would involve a joint maximization over power allocation 
and rates. This would complicate the analysis, but is a rather important topic for future research. The 
maximization in (fTTl) can be written as 



max y^ wi-gi = max y^ wi-gi = max m ■ mi {Pi l^i) 

, get ^ pen ^ ^ 

I I n:n=b(l) n:n=e{l) 

= max y] Wn ■ flnQn {P, ^n) (19) 

n:n=b(l) 

where (a) comes from the fact that the objective function is linear and the supporting hyperplanes to the 
sets f and T = co |f | are the same, while (b) from Assumption 1. The latter simplifies the problem 
to a weighted sum maximization problem with number of summants equal to the number of nodes 
and allows the formulation of a noncooperative game in the following subsections, where each node 
decides independently over its transmitting power through a single chosen link. The capacity region of 
the system in Theorem [T] is of course reduced. An important question is how each node choses the 
optimal single link to transmit. 

The number of links with node n as origin are all links {I E C : n = b{l) & > 0}. This is the 
connectivity set of node n. The optimal link is obviously the one which provides maximum weighted 
goodput to the above summation, for a given power allocation pn E [-P^™, P^^^] ■ 
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Departing here briefly from the main line of the analysis, we end this subsection with a heuristic 
suggestion for an almost optimal choice of a single receiver node, using low information exchange 
between the network nodes. Applying Markov's inequality in ([T]) 



um¥ {SINRi (p) > e^' - 1) < {SINRi (p)) . 

Suppose that the end node e (/) of each link / measures the received level of interference, the latter 
denoted by X;. This is not any more a random variable with unknown realization but rather a known 
deterministic quantity. The right handside then reduces to 

uifii E (Gi^iFi^ipi) _ uifii Gi^ipi ^20^) 



[-^l + ^e{l) ) [-^l + ^e(0 

This is an upper bound on the actual error probability. The process of the sub-optimal choice then is as 
follows. Each destination node of links belonging to the connectivity set of node n, informs the origin 



node over Xi + cr^^;^ and afterwards n chooses to transmit over the link with the maximum ratio (1201) . 
since Gi^i, coi and fii are known to n. 

An alternative way to choose a single receiver node could be by assigning to each element of the 
connectivity set a probability, with sum equal to one per transmitting node, and the choice will then be 
a random process. 

B. Optimality conditions 

Using the Karush-Kuhn-Tucker (KKT) optimality conditions and observing that the active inequality 
constraint gradients are linearly independent [27, pp. 315-317] all feasible vectors p'are regular and we 
have the following necessary conditions for to be a local maximizer of the objective function in (fT9l) . 

= (p , z/ , z/ ) = z/„ - + a;„ ■ /i„ ^- + 2_^uJm- f^m g- (21) 

for each n and the complementary slackness conditions satisfy 

L (jf, z7', z7") is the Lagrangian of the problem in (fT9l ). The conditions are only necessary and not 
sufficient. We make here the remark that if the objective function were concave, the dual gap would be 
zero and any local maximizer would also be global for the problem at hand. In this case the conditions 
would also be sufficient. Unfortunately this generaly does not hold for the specific objective function. 
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Divide dlD and (I22l) by g„ (f^fin) (which is definitely positive if we choose P^'"^ > 0, Vn G Af) 
and then - approching the problem similarly to [16] - set 

n ^ 9q„, {p, 

0>UJm- fira ^ 7^ T = 7^ T (23) 

= T^m,n (P) (24) 

and for the Lagrange multipliers 

z>" = > & z>i = ,i , > 0. (25) 

With the above substitutions the per node condition in (|2TI) is rewritten as 



""■'^" ,„(p->.,„) ap„ (26) 

Then (|26l) with the related complementary slackness conditions are the necessary and sufficient condi- 
tions for to be the global maximizer of the problem 



maXp„ Un^n log (g„ {Pn, yUn) ) + Pn Y.mjtn ^rn,n (P*) (27) 

since by property P.l of the success function, logg„ {p^ l^n) is concave in the constraint set p„ e 
[P™™, P™*^""] is convex and compact and Slater's condition holds true. This explains now why the 
division in (l23l) and (l25l) was required. 

C. A Supermodular Game 

If we view — tt^ „ (p) as the price charged by user m to user n for affecting its goodput by 
creating interference, we can approach the solution to the optimal power allocation problem in a 
distributed fashion with the use of game theory. We denote the noncooperative game by the triple 
Q = (A/", n, {Jn {■) , n E A/"}) where TV are the players, 11 is the set of feasible joint strategies and J„ 
is the payoff function for user n. 

We distinguish between two types of players. First, the power players who belong to the set A/"^, 
each one of which represents a node and the set of feasible joint strategies 11^ is identical to the set 11 
of feasible power allocations. Their payoff function equals 



Jn {P )) 

log (g„ + p„ ■ ^ 



(28) 
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We often set c„ := XlmT^n ^m," emphasize the dependence of J„ on the sum instead of the individual 
prices. The best response correspondence for player n is the set 



Yn [p-n) = arg max J„ (p„; (Tr^.n)) 
Pnen„(p_„) 



(29) 



where n„ (p_„,) = {p„ : (p„,p_„) G n^}. 

Second, the przce players who belong to the set A/'^'' := {(m, n) : m ^ n, m,n e M} with cardinality 
iV X (A^ — 1). The feasible set of strategies for player (m, n) is 



jjpr 



min TTra,n (p) , 

pen 



(30) 



where Tr^.n (p) is given in (|24l) . The best response for a price player is denoted by (following [16]) 



^m^ = arg max^ - {n^,n - ^m,n {p)f (31) 

and n^'' = {11(2,1), • • • 5n(7v_i AT)} is the joint feasible set. 

A Nash equilibrium (NE) for the game Q is defined as the set of power vectors jf = (pi, . . . ,p^) 
and price vectors tt^ = {^2,17 • • • ; 7r^,i) • • ■ ; „, . . . , with the property for every n,m E and 

every (m, n) E A/"^*" 



Jn {Pn^f-n^ (^m,n)) > -^n {Pn^f-n^ {Ka,n)) > & ^T^.n = T^m,n {f) , Vp„ E Tin {f-n) (32) 

Hence p'^ belongs to the best response correspondence of player n, Vn E M^, given the equilibrium 
prices, whereas vr^ „ belongs to the best response correspondence of player (m, n) E M'^''' given the 
equilibrium powers. 

The existence and uniqueness of the NE when the prices do not take part as players in the game has 
been proven in [28, Th.III.l] under mild assumptions on the problem parameters usually satisfied in 
practice. In our case however with A^ + A^x(A^ — 1) = A^^ players the uniqueness of a Nash equilibrium 
is not guaranteed. We can however make use of the theory of supermodular games, exploiting the 
structure of the payoff function in (l28l) to find algorithms that converge to one of the Nash Equilibria. 
We first give the definition of a supermodular game from Topkis [25] 

Definition 1 A noncooperative game with N players {A/", 11, {/„ : n E A/"}}, each having strategy 
belonging to the feasible set of strategies n„ (x_„), is supermodular if the set 11 of feasible joint 
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Strategies is a sublattice of and for each n the payoff function is supermodular in player n 's 
strategy x„ and has increasing differences for all pairs (xj, xf) G Ilj x 11^, i ^ j, i, j & M. 

Theorem 4 The noncooperative game with N power players and N x [N — 1) price players is a 
supermodular game [25, p. 178]. Furthermore, the set of equilibrium points is a nonempty complete 
lattice and a greatest and least equilibrium point exist. 

Proof: See Appendix B. ■ 
After proving that the problem at hand has the desired properties so that supermodular game theory can 
be applied we prove in the following that the Nash Equilibria of the game are exactly the power alloca- 
tions that satisfy the KKT necessary optimality conditions of the original sum weighted maximization 
problem. 

Theorem 5 Under the condition that \/n, P™™ > 0, a power vector is a Nash Equilibrium of the 
supermodular power-price game if and only if it satisfies the necessary optimality conditions (|27])-(|22]). 

Proof: See Appendix C. ■ 
The above theorem is rather important because it shows that the formulated game leads to one of 
the solutions of the scheduling problem. If the objective function in (fT9l ) is concave then the NE is 
also unique and the game converges to the unique global maximizer. The suboptimality of the proposed 
scheme in the current work thus lies solely on the fact that the KKT conditions are only necessary but 
not sufficient. If we can define the region of 11 for which the objective function is concave and restrict 
the feasible power allocations to that, the suggested distributed solution is the optimal one. This can be 
a topic for future investigations. 

D. The Scheduling Algorithm 

In the current paragraph we provide an algorithm which updates for each player the power allocation 
Pn and the price 'Km,n- Starting from any initial point within the joint feasible region, the algorithm will 
eventually converge to a NE bounded component- wise by the greatest and least NE. It is related to the 
Round-Robin optimization for supermodular games [25, Ch. 4.3.1], versions of which are suggested in 
[16] and [17]. 

The algorithm has two phases for each iteration t and is given in Table I. The power update phase 

calculates the best response for each user n by (l28l) given fixed prices iXm.n and the opponents' decisions 

-it) 

F-n- 

During the price update phase each user m calculates (A^ — 1) new prices Tim,n (without the hat) by 
(l23l) given the updated power vector. Then all users m ^ n communicate the values T^m,n to user n, 
who divides their sum by g„ (p, //„) to form the new sum price Cn^^"* for the next power update phase. 
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Observe that for each iteration, user n should know: (a) Its own rate of transmission /i„ (which defines 
Qn) and weight u;„, (b) the power profile of the other users (c) the prices Tr^.n communicated by 
the interfering users and (d) the slow fading coefficients Gm.n which depend on the distance between 
the nodes. 

E. Implementation Issues 

Considering implementation issues of the algorithm, information (b) and (c) should be communicated 
to node n, while (d) should be globally known. Notice that communicating the information over the 
power profile of the interfering users will violate the distributed nature of the algorithm. Instead of 
the power vector p_„ however, it suffices for each user to measure the current level of interference 

= Yjva^n GmnFmnPm in wMch CaSC WC WrltC 



qn{Pn,^n,l^n) = ^ [ r > 2 - il^n) = exp 

where / : b(l) = n and the second equality holds for Rayleigh fading. The payoff function will change 
accordingly. In the price update phase observe that the partial derivative of qm with respect to pn will 
be given by 

UJmfJ'm OPn OJ^m OPn ^mfJ^m 

The new values 0^ can be computed by each user m and are independent of the destination user n. 

y^m \Pmi-^m) ^mf^mQm \Pmi-^mjf^n 



In the form (1331) observe that the actual realization of the random variable Fnm appears. Remember that 
Fnm is the fast fading channel power coefficient. This information is unknown. But node n is interested 
in the sum c„ of the prices ■Km,n (see (HTI) ) which can be written as 



^ \ ^ ^ Gnm-Fnm^'m ijPm^'F^ni) (34) 



Qn \Pn ) -^n i l^n, , 



If each node m ^ n broadcasts a sequence of random symbols Sm, \Sm'f = 1 with power (j)m {Pm,^n 
the received signal at node n will be (assuming reciprocity of the channel gains) 



F„ = ^ HnmV (pm (Pm, ^m)Sm + NoisC 
m^n 



(35) 
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and its power is {Y^f = Em^n GnmFnm(pm {Pm, 1m)+crl. If the receiving node n divides by -g„ 
we get a noisy version of the expression in (|34l) . 

The above idea is borrowed from recent works that deal with ways to use the Wireless Multiple 
Access Channel (MAC) in order to compute general functions of data among which is also addition 
[29]. The above method using power to convey information can be found specifically in [30]. From the 
above we realize that although the fast fading coefficients are not known to the users m that have to 
calculate the prices tt^.w these can be revealed to the receiver n within the sum signal in (l34l) . 

Finally rather important is the fact that for the implementation of the algorithm, each user m has to 
be aware of its received interference Im and actually calculate only a single price (pm- Then in a single 
step during the price update phase each player/node broadcasts its price (pm, while acting simultaneously 
as a receiver (remember Assumption 2) to obtain the channel-power-weighted sum of the prices of the 
other — 1 users. The entire network topology is not any more necessary to be known to each user 
m, only the slow fading gain Gmm- This allows the scheduling algorithm to have as well application 
in cases where the topology possibly changes due to user mobility. 

VII. Simulations 

Simulation results of the proposed scheme for congestion control, routing and distributed power 
allocation when hop-by-hop retransmissions are taken into account are presented in Figl6l We used a 
four node topology having two commodity flows with sorce node 1 and destination nodes 3 and 4 
respectively. The congestion control requires the solution of the subproblems (IT6l) and ([TT]) respectively 
with prices A (t). The prices are updated per node using the expression in (fTSl) . The optimal links per 
node are chosen at each step using (l20l) . The scheduling problem in ([T9l) is solved initially by brut force 
(left column) to provide a comparison with the results obtained when the price based algorithm is used 
(right column). We notice that although the uniqueness of the Nash Equilibrium cannot be guaranteed 
the results of the suggested algorithm considering the maximum supported incoming rate as well as the 
queue length (price Ai) are almost optimal. An important remark is that the two solutions would be 
exactly the same if the objective function in ([T9l) would be concave. 

VIII. Appendix 

A. Proof of Theorem |J] 

In the following we will neglect the dependence of functions on variables that are considered constant 
throughout a proof. 

. PM: Suppose > pi and let ^'l := argmax^^e^^ /i^g/ ^j), and also Jii := argmax^^g^^ fnqi {pi, ///). 
Then V/i; G M 
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(a) (b) 

i^Iqi {vt^^i) > mi (pt^i^i) > mi (Phf^i) 

In the above, (a) comes from the definition of /i/^ and (b) from P.l of the success probability 
function. Since the inequality holds Vfii it also holds for fii = fii, hence gi (pl^P-i) > gi {pi,p-i). 
P'.2: For the monotonicity we proceed as above, where p^ > p^, := argmax^^gyvi ^iqi (p^, //;), 
and also Jii := argmax^^gyn niqi {pk, Hi). Then V/x^ G M 

mi {Pk,fj'i) > mi {Pk,fj'i) > mi [pt^f^i) 

where (c) comes from the definition of /i/^ and (d) from the monotonicity in P.2. Since the inequality 
holds VyUi it also holds for fn = ^j, hence gi {p^,p-k) < Qi {pk,P-k)- 
For the convexity we write for p^^^ ^ pf^ 



(P2) 

gi(ep^^^ + {i-e)pf) = m^^xf,iqi{ep^^ + {i-e)p^^\t,i) < 

rnax ^^OfiiQi (^p^^'' , fii^ + (1 - 6) fnqi (^pf ' } - "]f^ ^mi (Pk'' ^ l^i) + "]fx (1 - 0) fnqi [pf'' , ni 

= egi{pi\p.k) + il-0)gi{p?\p^ 



P'.3 Choose pf > pf and denote n'^ := argmax^jg_A4 mi {p\-, /^O' ^'^^ argmax^jg_A4 mi (Pi- 

By definition 

H'l Ql [Pi 1 fJ-i ) 

We prove the property by contradiction. Suppose that /i^* < ^f. From the log-supermodularity 
property P.4 

Qi {pI f^t) ^ 11 {pI /^f) .3^. 

QdPt,f^t) qiipt^f^l) 

Combining ^ and dBV]) 

m q,{p\^^,^) m qdpt,^^t) , . 

— > — -r-^, — ^ > — 1-^ — ^ =^ f^iQi[PiifJ-i) > fJ-iQi{PiilJ-i) i^o) 
Qi{Phf^i) <li[Pt,fJ'i) 

But 081) is impossible from the definition of nf := argmax^^g^vj niqi (p", hence > /x". 
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• P'.4: We make use of the fact that given a pair ypf^\ fi^j^^j there always exists another one 

(^p\'^\fiPy with p}^^ ^ and /i^^ ^ /if^ such that qi (^p\^\p^i, fi\^''^ = qi (pf\p^i, ^xf^^. 
This is because V/x; G M, the success probability function qi {pu^i) G [0, 1] is strictly increasing 
in pi and strictly decreasing in by P.l and P.3 (here pi G M+). 

Denote by ^\ := aigmax^^^M mi ^"^^ •= argmax^^gyK {Pk^fJ'i)- 

Using the above fact we can write qi {pi,pt, fif) = qi {pf,pt, and qi [pupl, /if) = qi [p^pl, ^ii), 

e.g. for some /i; > max {/if , /if}, > and p\ > pi. By definition 

/i[ ^ g< jPhPlf^t) ^ Qi {pt,Pll^i) ^3^^ 

/^r ~ qi {phPi,ijf{) qi [plplf^i] 

The property is proven by contradiction.. Choose pi > pi and suppose that 

(P3) 

/if > /if ^ Qi {phpl f4) < qi {phpl f^t) ^ 

qi{p\,pll^i)<qi{plpll^i) p'<Pi (40) 

From the log-supermodularity property P.5 we reach the inequality /ifg/ {phPl^ /*f) > f^fqi (PhPk^ /^?) 
which is impossible by the definition of ^1 ^ p\ < pi is impossible <^ ^\ > j^i is impossible. 

B. Proof of Theorem 6 

The set of joint feasible strategies 11^ x 11^'' G is a sublattice of M^^. The set is also compact since 
for a power player n G M^, Pn G [-P™°, -P™^""] while for a price player „ G [minp-gn? tt^,™ (p) , 0] 
and the lowest endpoint of the interval is > — oo for P™"^ > e > 0, Vn (see the expression for the 
success probability ^ and its derivative (fTTI) . 

Since the set of feasible strategies for power player n is a compact subset of M} the payoff function 
in (|28] ) is supermodular in p„. We have further seen in property P.5 that the logarithm of the success 
probability function of user n has increasing differences for each pair {pn,Pm) ,Vm ^ n and constant 
differences for each pair {pmi,Pm2)^ 7^ ""1^2 7^ n. Then logg„ (p, /^n) is supermodular in p G 11^. 
Observe that p„ ■ J2m^n'^in,n is also supermodular and by property [25, Lemma 2.6.1(b)] the sum of 
supermodular functions is supermodular. We reach the conclusion that J„ = UnUn ■ log q-n (p, fJ'n) + CnPn 
has increasing differences in all pairs {pi,Pj) for distinct i,jE M^. 

The expression for J„ in (|28l) is a valuation (has constant differences) for each pair (7rj„,7rj„), 
i,j 7^ n. Finally for the pairs (pi,ifj^n) the function has also increasing differences if i = n and is 
a valuation for i ^ n. Then we reach the conclusion that J„ has increasing differences for each pair 
{xi,Xj), i,j G M^^JM^^. By definition [T] the game is supermodular. 
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The set of feasible joint startegies 11^ x 11^^ is shown to be compact. Observing that the expression in 
Q and hence J„ is continuous in p„ and having proven that the game is supermodular, the 'Furthermore' 
part of the theorem comes from [25, Th.4.2.1]. 

C. Proof of Theorem 7 

The 'if part comes directly from the way the extended supermodular game was formulated. For the 
'only if part we argue as follows. Suppose is a Nash Equilibrium of the problem. Then for each n 



Pn = arg maXp^g[pmi„^pmax] <^ log g„ [pn] f-n-, l^n) + TT^.n 



= arg max grp^in p,^axi <^ log g„ ^„ ) + Pn > — r ^ > 

^ " " ^ [ ^^<ln{p^,f^n) dpn J 

Unfin log Qn [Pn] P-„, fJ-n) ' Qn [P , ^n) + Pn ^ ^^mfJ-m o 

The necessary and sufficient optimality conditions for the above problem are 

dqn {Pn, f^n) Qn , fin) , dq^ , f^m) , / u n 

U^nl^n J =g T + > UJmfim 7^ + Z/„ - Z/„ = 

OPn qn{Pn,f-nitJ'n) OPn 

^n ■ {pr-Pn) > & < • (p„ - prn > o 

Since p^ is the global maximizer (remember that the objective function is concave) the necessary 
conditions (|2TI)-(|22|) of the scheduling problem are satisfied Vn. 
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Wireless Node 3 



Fig. 1. An example of the wireless network with a single commodity d — 4. Detail of node 3 
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Maximum goodput g,(p,,P2) vs. power p, Optimal Rate n/p^.p^) vs power p, 

I.Sl 1 1 1 1 1 1 ! 

0.7 I 1 1 1 1 1 1 ! 1 1 1 




Fig. 2. We simulate a 2-user Rayleighi/Rayleigii fading chiannel with set of rates A4 = {0.4, 0.8, • . • , 2} and P2 = 5 Watt, Pi = 20 
Watt. Properties P'.l and P'.3 are illustrated a. Maximum Goodput (7i(pi,P2) vs power p\, b. Optimal rate pi (pi,P2) vs power p\. 



Maximum goodput g^Cp^jPj) vs power p^ Maximum rate ^^{p^iPj) vs Power p^ 




5 10 15 20 25 5 10 15 20 25 

Power Power 



Fig. 3. We simulate a 2-user Rayleigh/Rayleigh fading channel with set of rates M — {0.4, 0.8, . . . , 2} and p2 — 20 Watt, Pi = 25 
Watt. Properties P'.2 and P'.4 are illustrated a. Maximum Goodput gi{pi,p2) vs power p2, b. Optimal rate fli (pi,p2) vs power p2. 
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Goodput Region r for a network with 2 transmitting + 1 receiving node 



0.05 0.1 



0.15 0.2 0.25 0.3 0.35 05 

max^ (1^ q,(P,,Py|J,) 



Fig. 4. The 2-user Rayleigh/Rayleigh goodput region Fi and Fi for the network of 2 transmitters and a single receiver. The convex 
hull is shown with the dashed dot lines. For the illustration M = {0.4, 0.8, . . . , L8}, P"''"' = 2 Watt, Pa™'''' = 3 Watt and the success 
probability function in ^ has been used with Gi,i = Gi,2 ~ 1, G2,2 = G2,i = 1. 



Goodput Region r for a networit witfi 1 transmitting + 2 receiving nodes 




Fig. 5. The 2-user Rayleigh/Rayleigh goodput region f 2 and r2 for the network consisting of 1 transmitter and 2 receivers. The convex 
hull is shown with the dashed dot lines. For this topology, M — {0.2,0.4,0.6}, pi + P2 < P = 10 Watt and the success probability 
function in ^ has been used with Gi,i = G2,2 = 1, Gi,2 ~ 0.5, Ga.i ~ 0.8. 
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Distributed Algorithm for the Scheduling Problem 

INITIALIZE 

• Choose the least element of 11^ x 11^^: (-P™™) for the power players and (minp-gnp 7i"m,n (p)) for 

the price players. 
. Set t = 0, A; = 0. 
REPEAT 

1) Power Update: For k = 1,. . . ,N 

, Given (ni]r?j and p<*''^-i) 

(t,k) j(t) ( -Jt,k-i)\ 



where Jk is given in (I28I) . 

-(t,k) -(t,k~l) 

2) Price Update: 

• For A; = 1, . . . , A^. Given each user k updates the A — 1 prices vTfc „ for k ^ n 

f^t,N)\ % {^''^\f^k) 

'^k,n (F ') =^k- IJ'k 

OPn 

and communicates them to user n 

• Each user n receives A — 1 prices tt^ „ and calculates 



^) = E^m(p^*''^^) (41) 



3) Increase t by 1 

. Set p-<*'0) = p-<*-i'^). Set (vrSn) = (7r„,„ (pi*-!-^))) 

UNTIL p<*'0) =p-<*-i'0) and (ttS^) = (tt^'^) 
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commodity! z^destinationl /destination2 

Iink1 ^ link 2 links 



M ^ >(3 ) 44 

commodity2''^'^ li"l<4 



> 

links 



links 




Fig. 6. Congestion control for a four node topology with two commodity flows. The scheduling problem is solved using the suggested 
pricing algorithm. Comparison plots with a brut force search to find the global optimum of the weighted sum maximization problem in 
l |19l l are provided in the first column. 



